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ABSTRACT 

We measure the two-point spatial correlation function for clusters selected 
from the photometric MaxBCG galaxy cluster catalog for the Sloan Digital Sky 
Survey (SDSS). We evaluate the correlation function for several cluster samples 
using different cuts in cluster richness. Fitting the results to power-laws, ^cc{r) = 
(r/i?o)~^! the estimated correlation length Rq as a function of richness is broadly 
consistent with previous cluster observations and with expectations from N-body 
simulations. We study how the linear bias parameter scales with richness and 
compare our results to theoretical predictions. Since these measurements extend 
to very large scales, we also compare them to models that include the baryon 
acoustic oscillation feature and that account for the smoothing effects induced 
by errors in the cluster photometric redshift estimates. For the largest cluster 
sample, corresponding to a richness threshold of iV2oo > 10, we find only weak 
evidence, of about 1.4 — 1.7a significance, for the baryonic acoustic oscillation 
signature in the cluster correlation function. 

Subject headings: cosmologyiobservations - large-scale structure of the Universe 
- galaxiesxlusters - SDSS 
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Introduction 



Galaxy clusters hav e long been recognized as powerful cosmological probes (IBorgani 



20061 : IRozo et al I l2007al ). In particular, measurement of the cluster mass function vs. 



redshift constrains cosmological parameters, including those associated with dark energy 
jWang et al"~lll998l : iHaiman et al II2OO1I : bladders et al. II2OO7I : IRozo et al. 1l2007bl : iMantz et al 
2007h. This has m otivated the design of new wide- area cluster surveys in the optical 



flAbbot et al. 112007 ). X-r av flRapetti et al. 



20061 ). and using the Sunyaev-Zel'dovich effect 



(SZE) (IRuhl et al. II2004I : ISeghal et al II2007I ). The utility of this probe hinges on limiting 
the uncertainty in the relation between cluster mass and whatever observable (e.g., optical 
richness. X-ray luminosity, or SZE flux decrement) is used as a proxy for it. Measurement of 
the two-point correlation function of clusters can help calibrate such mass-observable rela- 



tions and thereby improve the resulting cosmological constraints flMajumdar fc Mohr 112004 



Lima fc Hull2005l ). 



Since clusters are the largest virialized mass concentrations in the Universe, measure- 
ment of their spatial clustering also provides insight into models of large-scale structure 
formation and tests theoretical frameworks, such as the Halo Model, that describe the rela- 



tion between__the_gala^^ the bias (IRozo et al. 



2OO4J : lYounger et al. II2005I : ISmith et al. Il2007bl : ISchultz et al. II2OO6I : ISmith et al. Il2007ah . 



On very large scales, r ~ 100 h ^ Mpc, the two-point correlation function or power spec- 



trum of clusters should show evidence of baryon acoustic oscillations (BAO) flAngulo et al 
2OO5I ). In concert with measurements of the cosmic microwave background anisotropy, the 



BAO sc ale provides an estimate of cosmic dis tance and thereby a geometric probe of dark 
energy (jSeo et al I l2003l : iHu fc Haiman II2003I ). A possible detection of BAO in th e powe r 



spectrum of clusters from the Abell/ACO catalog was reported in iMiller et al. I (120011 ) 



while earlier studi es had claimed evidence for a feature in the cluster correlatio n function at 
r ~ 125 h-^ Mpc f lXopvlov et "al~lll998l : IMo et al. 1^99^ : lEinasto et a"l~lll997a I ffll 



The BAO feature was detected at 3.4(T significance in the two-point correlation function 
of ~ 47,000 luminous red galaxies (LRGs) with sp ectroscopic redshifts in t he range 0.16 < 
z < 0.47 in the Sloan Digital Sky Survey (SDSS) (lEisenstein et al. II 20051 ). Using slightly 



large r samples, the BAO feature was also detec ted in the LRG power spectrum (IHuetsi 



20061 : iTegmank et al. II2OO6I : IPercival et al. Il2007n . 



The BAO feature has also been inferred from the large-scale clustering of ^ 600, 000 



LRG s identified in the deeper SDSS photometric survey (IBlake et al. 1120071 : iPadmanabhan et al. 
20071 ). Although the photometric catalog covers a larger volume and contains many more 
galaxies than the spectroscopic sample, the galaxy redshifts in the former must be estimated 



3 



photometrically. With a photometric redshift uncertainty of cr^ ~ 0.03 for this sample, much 
of the information in the radial modes of the power spectrum is lost, and the BAO feature 
was detec t ed at less t han 3a significance. B as ed on theoret i cal co nsiderations presented in 
Seo et al I (120031 ) and iBlake fc Bridle I (120051 ) , iBlake et al. I (120071 ) presented a direct mea- 
surement of the 3D power spectrum, with a correction for the dampi ng of power in the ra- 



dial d irection due to photometric redshift uncertainties. By contrast, iPadmanabhan et al. 



(120071 1 measured the angular power spectrum in photometric redshift slices and used it to 
reconstruct the three-dimensional power spectrum. 

In this paper, we present measurements of the two-point correlation function for optically 
selected galaxy clusters in the Sloan Digital Sky Survey (SDSS) and study the possible 
detection of baryonic o scillations. The cluster samples are derived from the MaxBCG catalog 
jKoester et al. Il2007al jbl). in which clusters are identified as concentrations of red-sequence 
galaxies; the colors of these early-type galaxies are used to estimate photometric redshifts 
fo r the clusters. The two poin t correlation function for galaxy clusters in SDSS was studied 



m 



Basicalos &: Plionis I (120041 ) using an earlier cluster catalog. 



We focus on measurement of the 3D cluster correlation function. In comparing the obser- 
vations to models of large-scale structure, the model predictions are corrected for the effects 
of photometric redshift errors. We use two different methods for estimating this correction 
which are in good agreement for the photometric redshift error cr^ ~ 0.01 characteristic of 
the MaxBCG catalog. 

The cluster correlation function measurements provide weak evidence (~ 1.4 — 1.7a) 
for the presence of BAO in the cluster spatial distribution. An independ ent measurern ent of 
the power spectrum for the same cluster sample has been presented in (IHuetsi 1 120071 ) . also 
indicating weak evidence (~ 2a) for acoustic features in the power spectrum. 

The paper is organized as follows. In section [2] we describe the MaxBCG cluster catalog, 
the samples we derive from it, and our measurements of the cluster correlation function. 
In section [3] we introduce our model for the cluster correlation function and discuss the 
corrections due to photometric redshift errors, presenting two different correction methods. 
In section Hjtwo different estimates of the correlation function covariance matrix are described 
and compared. In section [5] we compare the model to the data, extracting estimates of the 
cluster bias as a function of richness and mass, and comparing the goodness of fit for models 
with and without BAO. We present our conclusions in section [61 
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The McLkBCG cluster correlation functions 



2.1. The MaxBCG cluster catalog 



The cluster sampl es we analyze are derived from the SDSS MaxBCG catalog ( Koester et al. 

2007a ) for SDSS DR5 jAdelman-McCarthv et allboooh . The MaxBCG method JXoester et al. 
2007bl ) identifies clusters using two optical properties. First, the brightest cluster galaxy 
(BCG) typically lies near the center of the cluster galaxy light distribution. Second, the 
cores of rich clusters are dominated by red-sequence galaxies that occupy a narrow locus in 
color-magnitude space, the E/SO ridge-line. MaxBCG uses a maximum-likelihood method 
to evaluate the probability that a given galaxy is a BCG near the center of a red-sequence 
galaxy density excess. Once a list of potential cluster centers is obtained, galaxies are grouped 
around those centers and the clusters are identified. 

One measure of the richness of the clusters, denoted A^200) is defined as the number 
of galaxies on the E/SO red sequence brighter than 0.4L* that lie within a scaled ra- 
dius -R200 of the BCG, where i?2oo is the radius within which th e density of galaxies with 



24 < Mr < —16 is 200 time s the mean density of such galaxies flHansen et el. 1120051). Dv- 



namical (IBecker et a. 



2007; Sheldon et al. 



2007) and statistical weak lensing measurements (jjohnston et al 



20o3) indicate that A^2oo is strongly correlated with cluster virial mass. 
The public MaxBCG catalog contains 13,823 clusters with A'^200 > 10; the catalog is approxi- 
mately volume-limited over the redshift range 0.1—0.3 and covers 7500 square degrees. Tests 
on mock catalogs indicate t hat the MaxBCG sample should be > 9 0% pure and complete 
for clusters with A^'2oo > 10 (IKoester et al. Il2007al ; IRozo et al Il2007al ). 



We subdivide the MaxBCG catalog into four samples for analysis, using the following 
thresholds in cluster richness: A^200 ^ 10, H, 13, and 16. The corresponding virial mass 
thresholds, based on sta tistical weak lensing rn easurements, are approximately 4.3, 5.1, 6.1, 
and 7.9 x 10^^ h~^MQ ( 1 Johnston et al. 1120071 ). The resulting numbers of clusters Nc and 
the spatial number densities Uc (assuming a survey volume of 0.5h~^ Gpc^) for each sample 
are given in Table [H Note that by virial mass we mean the mass denoted by M^r in 
( 1 Johnston et al. 1120071 ); it is defined in terms of the overdensity at collapse with respect to 



the background density by the redshift- and cosmolog y-dependent formula, A 



82x - 39x2)/(l + x), with x = nm{z) - 1, derived in flBrvan fc Norman 1119981 ) for ACDM 



mr 



'187r2 



models. This definition corresponds to the mass M that appears in the halo bias formulas 
of Section 15. 3[ where we drop the subscript for simplicity. 
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Table 1: Cluster samples used in this analysis: Nf. is the number of clusters in each sample, 
ric indicates the mean number density for each sample, assuming a volume of 0.5 Gpc^. 

sample A^^^ Uc [ h? Mpc"^] 

N2QQ > 10 13823 2.8 x 10"^ 

A^aoo > 11 11265 2.3 x 10"^ 

iVaoo > 13 7796 1.6 x 10"^ 

iVaoo > 16 4853 1.0 x 10"^ 
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The redshifts for the MaxBCG clusters are estimated photometrically from the g — r 
colors of the red-sequence galaxies. Since the color locus of these galaxies has finite width, 
the cluster photometric redshifts have a non-zero dispersion around the true values. Tests 
using a subsample of the BCGs with spectroscopic redshifts indicate that the dispersion in 
the cluster photometric redshift estim ates is approximately cr^ = {{z^h — ZspYY^"^ — 0.01, 
with a small dependence on richness (IKoester et al. Il2007al ). For the > 10 sample, 
the dispersion varies from 0.006 at the lower end of the redshift range {z = 0.1) to 0.011 at 
the upper end (z = .3). The error distribution is generally well described by a Gaussian 
jKoester et al. Il2007ah . 

As we will show, for separations larger than ~ 50 Mpc, the translation of the cluster 
correlation function from real space to photometric-redshift space depends sensitively on 
the photometric redshift uncertainty, cr^. In this wo rk, for simplicity we a ssume a constant 
photo-z dispersion of az = 0.01, as suggested by (IKoester et al. 112007a] ). as our default. 
Throughout the paper, and particularly in §3.2.31 15.11 and I5.2.3[ we discuss the systematic 
errors associated with the uncertainty in a^. 



2.2. Estimation of the two-point correlation function 



We measure t he correlation fuiiction ^ (r) for each cluster sample by means of the Landy- 
Szalay estimator (ILandy fc Szalay 1 119931 ) . 



1 



RR{r) 



X 



n 



DD{r)^ - 2DR{r)— + RR{r 



n 



D 



riD 



where DD{r) represents the number of cluster pairs with separation r ± Ar/2 in the data, 
RR{r) is the number of pairs in the same separation bin in a random catalog uniformly 
distributed over the same survey volume, DR{r) is the number of pairs with one member 
of the pair from the data sample and the other from the random catalog, Hr is the number 
density of clusters in the random catalog, and tt-d is the number density of the data cata- 
log. To reduce the effects of shot noise, the random catalog is five times denser than the 
data catalog. The random catalog is generated using the same ang ular mask used for the 



weak lensing analysis of the MaxBCG catalog ( Sheldon et al. 1 120071) and with the redshift 



distribution measured in the data, see Fig. 4 in IKoester et al. I ( l2007al ) 



Due to selection effects, the redshift distribution changes slightly as a function of the 
richness threshold, so a different random catalog is generated for each sample. The separation 
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r is given in comoving coordinates, and it is obtained assuming a flat ACDM cosmology 
with Qm = 0.27. Where needed, we adopt the following other cosmological parameter values 
in this work: the Hubble parameter h = iJo/lOOkm/s/Mpc = 0.72, the baryon density 
fib = 0.046, the primordial perturbation spectral index Ug = 1, and linear power spectrum 
amplitude as = 0.9. 



3. A model for the cluster correlation function 

In this section, we develop a model for the cluster correlation function that includes 
the effects of non-linear evolution and bias ( §3.ip and photometric redshift errors ( §3.21) 
and that can be compared to the measurement of the correlation function on large scales, 
r ~ 20 — 200 Mpc, with particular attention to the region where the BAO feature is 
expected. 



3.1. Non-linear evolution and bias 

Although the rms density perturbation amplitude on large scales is much smaller than 
unity, one cannot rely on linear perturbation theory to precisely predict the large-scale cor- 
relation function, especially on the BAO scale of ~ 100 Mpc. The effects of non-linear 
evolution of perturbations on the acoustic features in the matter power spectrum and corre- 



and semi-analvtic techniaues and N-bodv simulations (Meiksin et al 19991: 


3eo et al 


2003; 


White 


2001 


: Eisenstein et al. 20071: Jeone & Komatsu 20061: iGuzik & Bernstein 


2007; 


Huff et al. 1 


2007: Crocce & Scoccimarro 


2007: Matarrese & Pietroni 200' 


^: Smith et al. 


2007a: 


Aneulo et al. 2008h. It has been recognized that the growth of structure induces a 



substantial damping of the acoustic peak in the correlation function with respect to linear 
theory; this damping must be taken into account when comparing theory with observations. 

In this work, we rnodel t his effect using Renormalize d Perturbation Theory (RPT) 
( 1 Crocce &: Scoccimarro I l2006l : ICrocce fc Scoccimarro 1 120071 ) . This prescription has the ad- 
vanta ge of being based exclusively on first principles and, as shown in ( ICrocce fc Scoccimarro 
20071 ). achieves remarkable agreement with results from N-body simulations. In general, the 
RPT non-linear matter power spectrum can be expressed as the sum of a term that accounts 
for the degradation of the initial power at a given wavenumber k and a term arising from 
the non-linear gravitational coupling of modes of different wavenumbers. 



PRPTik) = G\k- a)PL{k) + PMc{k\ a) 



(2) 
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Here G{k; a) is the RPT propagator from the initial conditions, a is the cosmic scale factor, 
Pl is the power spectrum in linear perturbation theory for the ACDM model, and Pmc is 
the contribution from mode coupli ng. We refer the reader to I Crocce fc Scoccimarro I (120061 ) 
and I Crocce fc Scoccimarro I (120071) for detailed descriptions of the computation of the power 
spectrum in RPT. As shown in (jCrocce &: Scoccimarro 1120071 ). the first term in Eqn. ([2]) is 
primarily responsible for the damping of the acoustic peak in the correlation function, while 
the second term gives a small correction of the order of a few percent at the relevant scales. 
We neglect the latter term in this analysis, since its contribution is subdominant compared 
to the observational errors in our measurements. 

While the RPT model has been validated against simulations around the BAO scale, its 
validity on the smallest scales we consider in our analysis, r ~ 20 h~~^ Mpc, has not yet been 
properly tested. We therefore limit its use to describing the damping of the baryonic peak at 
large scales. To model the non-linear evolution of the non-BAO part of t he power spectrum 
amplitude on small scales we use the halof it code (jSmith et al. II2003I ). As a result, our 
adopted non-linear matter power spectrum model is given by 



+PRF,nw{k; a) , 



(3) 



where the "no-wiggles" Pl^uw and PHF,nw are the linear power spectrum and non-linear 
halof it power spectrum wi th the acoustic oscillat ions edited out using the featureless 
transfer function derived in (lEisenstein fc Hu I Il998l ) . Given the measurement errors for 
the current cluster sample and the uncertainties in the photo-z errors ( §3.21) . we find that 
the theoretical uncertainties in modeling the non-linear evolution of the baryonic peak — as 
reflected in the different approaches in the recent literature — are negligible by comparison. 
For example, we find that usin g the alternative prescrip tion for the non-linear matter power 



20051 ) does not change the significance of 



spectrum P^i^k) followed by (lEisenstein et al. 
the BAO feature in this work. 

The dark matter correlation function is obtained in the usual way via the Fourier trans 
form of the non-linear matter power spectrum, 

1 /" „ , sin kr 



PNL{k)- 



-k'^dr 



(4) 



27J-2 J J 

For the ACDM model parameters given in §2.21 the predicted non-linear correlation function 
is shown as the black dashed curve in Fig. [3] below. To connect this to observations, we will 
assume, for simplicity, that the correlation function for a given cluster sample is related to 
the matter correlation function by a constant bias factor. 



(5) 
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A more accurate description of cluster bias is possible, but again the uncertainties in the 
bias prescription are expected to be small compared to the current measurement errors. We 
treat the bias factor 6 as a fit parameter in comparing the model to observations; in §5.31 we 
compare the fit values of the bias with predictions from the Halo Model. 



3.2. Effects of photometric redshift errors 

As noted in §2.11 the estimated photometric redshifts (photo-z's) for the MaxBCG 
clusters have a non-negligible uncertainty, — 0.01. This translates into a positional 
uncertainty along the line-of-sight of approximately 30 Mpc, depending on cosmological 
parameters. The effect on the three-dimensional correlation function is a smearing of the 
acoustic peak and a relative damping of power on small scales. We consider and compare 
two prescriptions to model this effect, one analytic, the other based on a direct geometric 
approach. For simplicity, we ignore the redshift dependence of cr^ in this analysis. We note 
that the effect of photo-z errors is analogous to but simpler to model than redshift-space 
distortions in spectroscopic surveys. 



3.2.1. Analytic power spectrum smearing 
In the first approa ch, we adopt th e simp le analytic prescription for power spectrum 



smearing introduced in iBlake fc Bridle I (120051 ). Assuming a Gaussian smearing along the 



line of sight due to photo-2; errors, the damping of the power spectrum in the plane-parallel 
approximation is given by 

Pc,.(fc±,A;,)=P,(fc)e-'=''^' , (6) 

where and kz are the components of the wavevector k perpendicular and parallel to the 
line of sight, Pc{k) = b'^P^L^k) is the true (real-space) non-linear cluster power spectrum, 
and a is the dispersion in comoving distance along the line of sight, which is related to the 
photometric redshift error by 

In this expression, we evaluate the Hubble parameter at the effective median sample redshift 
z = 0.22. Although the photo-z correction to the power spectrum is clearly anisotropic in 
Fourier space, one can retrieve the monopole part of the observed (photo- 2; space) power 
spectrum as 



Pc{k) = - J dcos9 Pc^^{k sin 9, k cost 
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2ak 



(8) 



where 6 is the angle between k and the hne of sight (see also (IHuetsi 1120071 )). The monopole 
of the measured (photo- 2;-space) correlation function can then be computed as the Fourier 
transform of Pc{k). 



3.2.2. Geometric smearing 

While convenient, the analytic approach above does not take into account light-cone 
effects or the effects of the survey geometry. Here we directly model the geometric effects of 
photo-z errors on the measurement of the correlation function. For simplicity, we consider a 
random Poisson distribution of points covering the survey volume, using the same random 
catalog that is used in the two-point function estimator ( §2.21) . We displace each point along 
the line of sight by a random distance drawn from a Gaussian probability distribution, 

(9) 

where 2ph is the simulated photometric redshift estimate for a point at true redshift z, and 
az is the standard deviation of the photo- 2; estimate. For each pair with true separation r 
in the random catalog, we obtain a "measured" separation f after the displacements due to 
photo- 2; errors. We then calculate the conditional probability P{f\r) for a pair to have a 
measured separation f after displacement, given the true separation r, or, equivalently the 
probability P{Ar\r) for the difference Ar = f — r, given the value of r. In Fig. [T]we plot the 
normalized probability distributions for Ar corresponding to four representative values of r. 



-P(2;ph) ~ exp 



(^^ph 



2a2 
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Fig. 1. — Normalized, conditional probability distributions P{Ar \ r) of the difference Ar = 
r — r, given r = 50, 100, 150, and 200 Mpc, as measured from the random catalogs, 
assuming Gaussian photo- 2; errors with cr^ = 0.01. The dashed blue lines correspond to the 
exponential fit of eq. (ITOl) . 
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By definition, Ar is limited by Ar > — r, and we therefore expect P{Ar\r) to be 
asymmetric in Ar. As Fig. [1] suggests, P{Ar\r) is well-described by an asymmetric Laplace 
(or exponential) distribution, 

P{Ar\r) = C{r) x 




for Ar > 
for Ar < 

which is continuous at Ar = 0, and where the normalization factor is 

C{r) ^ 



(10) 



(T_i_(r) + cr„(r)[l — exp(— r/(T_(r))] 



We fit the functional form in Eqn. ffTOj) to the probability distributions measured for values 
of r ranging from 10 to 250/i~^Mpc in steps of 10/i~^Mpc. We then fit the function a'+(r) 
to an exponential functional form with three parameters. 



a+(r)=a+,o(l + c+e-^'/'+), 
while for o"_(r), imposing the condition o"_(0) = 0, we consider the two-parameter form 



(12) 



(j_ r 



(r)=a_,o(l-e->'-) 



(13) 



The resulting parameter values, for four different values of the photo-z dispersion, cr^ = 0.005, 
0.007, 0.01 and 0.02, are given in Table[2l These functions, together with the values measured 
from the realizations, are shown in Fig. [2l Note that the probability distribution is never 
symmetric, as the two functions cr+(r) and o"„(r) differ significantly at all relevant scales. 
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Table 2: Values of the parameters determined by fitting the exponential expressions for cr±{r) 
to the dispersions measured from the Poisson realizations corresponding to cr^ = 0.005, 0.007, 
0.01 and 0.02. 







(r) 




(T+(r) 








r_ 




r+ 


c+ 


0.005 


7.20 


22.9 


7.80 


30.9 


0.596 


0.007 


9.93 


31.1 


11.2 


24.6 


0.840 


0.010 


15.1 


49.4 


17.8 


30.5 


0.913 


0.020 


29.4 


100.0 


36.1 


25.0 


2.318 
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Fig. 2. — Values of o"+(r) and o"_(r) measured from the Poisson realizations and correspond- 
ing fitting functions from Eqns. (fT2|) and f|T3l) as determined from the measured probability 
distributions P{Ar\r) for a Gaussian photometric-redshift error of = 0.005, 0.007, 0.01 
and 0.02. 
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Using this procedure, we obtain an analytic expression for the probabihty function 
P{Ar\r). The measured (photo- 2;-space) cluster correlation function, C,cc{f^), can then be 
approximately derived from the real-space correlation function ^{r) by a normalized convo- 
lution. 



rUr)P{r\ 


\r)RR{r)dr 




\r)RR{r)dr 



(14) 



where P{f\r) = P(Ar|r), and RR{r) is the total number of pairs in the random catalog 
at the actual separation r. We find that the separation dependence of this quantity in the 
random catalog can be fit over the relevant range of scales by 




where the separation r is in units of h ^ Mpc. 
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Fig. 3. — Predicted matter two-point correlation function, including correction for photo-z 
errors. Dashed black curve shows the non-linear matter correlation function (Eqn. Hj) for 
the ACDM model, while the dotted black curve shows the linear theory correlation function. 
Long-dashed blue and solid red curves show the non-linear correlation function in photo-z 
space for Gaussian photo-z errors with = 0.005, 0.007, 0.01, and 0.02. Long-dashed blue 
curves use the analytic plane-parallel approximation of Eqn. ([8]), solid red curves use the 
geometric convolution of Eqn. ( fT4l) . In all cases, the BAO feature is severely smoothed by 
the photo- 2; errors. Corresponding curves in the lower parts of each panel show the relative 
differences with respect to the linear theory prediction; the spike around r ^ 130 Mpc 
corresponds to the scale at which the linear correlation function vanishes. 
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3.2.3. Comparison 

In Fig. [3] we compare these two methods of estimating the impact of photo- 2; errors on 
the two-point correlation function, for Gaussian photo- 2; errors with dispersion = 0.005, 
0.007, 0.01, and 0.02. The continuous red curves show the photo- z-space correlation func- 
tions calculated using the geometric convolution in Eqn. fll4p : the long-dashed, blue curves 
show the corresponding results using the Fourier transform of the analytic approximation of 
Eqn. ([8]). On small scales, r < 30 Mpc, photo-2 errors strongly suppress the correlation 
function amplitude and flatten its slope. On larger scales, as the lower left panel of Fig. [3] 
shows, the combination of non-linearity and photo-2; error transforms the expected BAO 
bump into a more subtle inflection feature in the two-point correlation function. As a result, 
when statistical errors are included, we do not expect to be able to detect the BAO signature 
with high significance in this data set. 

The geometric and analytic results are in excellent agreement with each other for small 
values of the photo-z dispersion, but a difference is observed for cr^ > 0.01. In the data 
analysis and in the rest of the paper we adopt the geometric model and assume a constant 
error Cz = 0.01 for our fiducial results, except where otherwise noted. In this case, the 
photo- 2; error changes the amplitude of the non-linear correlation function — in going from 
real to photo-2 space — by as much as 170% on scales r ~ 90 Mpc. However, we note 
that even a small uncertainty in the photo-2 dispersion o"^ introduces a significant systematic 
uncertainty in the mapping of the correlation function amplitude from real to photo-z space, 
due to the difference in the smoothing of the acoustic features. For some of our results below, 
we therefore estimate a systematic error in the inferred real-space correlation function by 
comparing results from the geometric model for = 0.01 with those for = 0.007. At the 
same time, we note that a sufficiently large spectroscopic sample can in principle reduce the 
uncertainty in to a small level. We plan to carry out a more detailed study o f this and other 



issues in the modeling of the correlation function in photo-2 space elsewhere ( lEstrada et a.l 



20081). 



4. The covariance matrix 

In order to compare the model predictions to the data and extract parameter measure- 
ments, we must have an estimate of the error covariance for the correlation function. There 
are two common procedures for estimating the errors. The first uses a jackknife estimator 
by creating subsamples from the data; this has the advantage of being independent of model 
assumptions, but it may not properly account for the variance due to modes on scales larger 
than those spanned by the survey. The second estimates the errors from a model, either 
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using the variance among a large number of survey mock catalogs or, on large scales, us- 
ing an analytic estimate assuming Gaussian perturbations. The model errors in principle 
account for cosmic variance due to modes on arbitrarily large scales, but they assume the 
model provides an accurate representation of the data. In the data-model comparison of §3 
we present fit results based on both the jackknife and an analytic prediction; in this section, 
we compute these error estimates and compare them. 

For the jackknife error estimate, we produced 1000 subsamples of each of the four 
MaxBCG samples, in each case with 1/lOOOth of the clusters removed at random. This 
procedure corresponds to the "traditional" jackknife approach in statistics, but it differs from 
the standard jackknife practice in large-scale structure studies, in which entire subvolumes 
are removed at random. The correlation function is measured for each subsample, and the 
covariance matrix is estimated from 

covte,e,) = - - 0) , (16) 

1=1 

where = ^'(rj) is the correlation amplitude in the ith separation bin, r,, for subsample 
/, and N = 1000 is the number of subsamples. In Fig. H] we show the diagonal jackknife 
standard deviation, cT^(rj) = Cov{^i, ^j), for the four samples (black points). As a test we 
have also performed jackknife measurements by removing subvolumes. When the number 
of removed subvolumes is small, the jackknife measurements are noisy; when it is large, the 
results appear to converge to those from the "removal of clusters" procedure above. 

In the alternative approach to error estimation, the predicted error covariance assuming 
Gaussian perturbations is given by 



cov(e.,e.) ^ V X 

sin(/crj) sin (/cr^) 



(17) 



Here V is the survey volume and 



PUk)=b^Pj,L{k) + -, (18) 

where b is the linear bias parameter, PNiik) is the non-linear matter power spectrum com- 
puted as described in the previous section, and the second term in Eqn. (fTSj) accounts for the 
shot-noise correction. For the term involving l/n1 in Eqn. (flTll we make use of the integral 



/ d/c sin (/crj) sin (/crj) = -^^^oiri — Tj) 



TT 



2 Ar 



S,, , (19) 



where Ar = 5h ^ Mpc is the bin-size for the correlat i on fu nction measurement. For a 
detailed discussion of these shot-noise effects, see ICohn I ( 120061 ) . 



Fig. m shows the analytic prediction for correlation function standard deviations includ- 
ing the shot-noise correction (red, continuous curves), using the ACDM power spectrum 
and the best-fit values for the bias parameters from the analysis in ^ (note that those bias 
parameter estimates are derived using the jackknife covariance matrix). The predictions are 
consistent with the jackknife estimates at the few to 30% level for all samples. 
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Fig. 4. — Standard deviations cr^{r.i) = ^yCov{^i, ^j) of the correlation functions ^{r) for 
the four MaxBCG samples, estimated using the jackknife technique (black points) and as 
predicted by linear perturbation theory including the shot-noise contribution in Eqn. ( |T7l) 
(solid red). The linear theory prediction assumes the ACDM model parameters given in §2.21 
and values of the linear bias parameter derived in ^ 



- 21 - 



The expression in Eqn. f fT7|) is the analytic estimate for the covariance of the correlation 
function in real space. To estimate the covariance for ^ in photo- 2; space, a naive approach 
would be to replace the expression for Ptot{k) in Eqn. fll8l) with a photo- z-corrected expression 
analogous to that in Eqn. ([H]), i.e., 



PUk) = b'PNL{k)f^eTf{ak) + - . (20) 

This would result in a predicted covariance generally lower than that in real space and more 
discrepant with the jackknife estimates, about 50% lower than the latter for all samples. 
Such an expression would account for the smoothing induced by photometric errors along 
the line of sight, but it does not include the extra component due to the intrinsic randomness 
of photo- 2: displacements. We postpone a more detail ed discussion of the c ovariance of the 



correlation function in photo- 2; space to future work (lEstrada et a.l II2008I ): here, we limit 
ourselves to the analytic estimate of Eqn. (|T7|1 . This expression does not take into account 
contributions due to the non-Gaussianity generated by gravitational instability and due to 
the anisotropic geometry of the survey. In these respects, it should provide a lower bound 
for the actual correlation function covariance. 
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Fig. 5. — Cross-correlation coefficients for the binned correlation function for the ^ 
10 sample determined from the jackknife technique (top) and from the Gaussian analytic 
prediction in real-space (bottom) including the shot-noise contribution. The horizontal axes 
show the linear bin separation label i, where i = 1 corresponds to = 5 Mpc and z = 39 
to Ti = 195 /i"^ Mpc. 
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In Fig. [5] we show the cross-correlation coefficients, defined by 



(21) 



ij — 



v/covte,6)cov(e„e,)' 



for the N200 > 10 sample. We present results for both the jackknife estimator (top panel) 
and the Gaussian analytic prediction in real space (bottom panel) including the shot-noise 
contribution. The off-diagonal elements differ significantly between these two approaches, 
and it appears that the jackknife may underestimate them. Underestimating the off-diagonal 
elements could lead one to assign greater significance to features in the correlation function 
such as the acoustic peak, since it corresponds to underestimating the covariance between 
different separation bins. In other words, when comparing two models to the data, say, one 
with the BAO feature and the other without, if the jackknife and linear theory diagonal 
covariance elements are similar (as in, e.g., the A^2oo ^ 10 case in Fig. H]), then the jackknife 
error estimate will yield a larger value for the difference between the two models compared 
to the linear theory error estimate. 



In this section we present the correlation function measured for the four cluster samples 
introduced in § [2], using the estimator of Eqn.([T]). We first present the measurement on 
scales r < 60 Mpc and present fits for the correlation length and power-law slope of the 
two-point function, corrected for photo-z errors. We then consider the correlation function 
on larger scales and compare to the model presented in § |3l determining the best-fit values 
for the cluster linear bias parameter and the evidence for baryonic features. 



5. 



Results 



5.1. Estimate of the correlation length 
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Fig. 6. — Data points show the cluster correlation function in photo- 2; space measured for 
the four richness samples in bins of width Ar = 5/i~^Mpc, with jackknife errors. Dotted 
red curves show inferred power-law $,cc{r) in real space, assuming cx^ = 0.01. Solid red curves 
show best-fit power-law models convolved with the photo-z error distribution, C,cc{r), which 
should match the data. Dashed blue curves show power-law fits to the data assuming no 
photo- 2; error correction, = 0. 
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Historically, measurements of the cluster correlati on function found results consistent 



with a power law over scales r < 60 


h ^ Mpc or so ( 


Bahcall et al. 


1983 


1992; 


Croft et a.l 


1997; 


Nichol et a.l 


1992; 


Gonzalez et a.l 


2002 





r 

Rq 



(22) 



where the correlation length Rq depends on cluster richness, and the slope is 7 ~ 1.8. 
A compilation of these results, along with measurements from an earlier, much smaller 
ve rsion of the MaxBCG catalog (from the SDSS Early Data Release, EDR), can be found 



m 



Bahcall et al. I (120031 ). 



In Fig. [6], we show the estimated correlation functions for the four maxBCG samples 
over the range 20 — 60 Mpc. To fit these results to the power-law form of Eqn. (!22l) . we 
must include the effects of photometric redshift errors. We do this by inserting Eqn. (1221) 
into Eqn. (1141) and comparing with the data, assuming a photo-2; scatter of cr^ = 0.01. The 
resulting best-fit correlation function in observable space, ^cd'f^), and the inferred power-law 
correlation function in real space, C,cc{r), are shown as the solid and dotted red curves in 
Fig. [6l In Table [3l we present the inferred correlation length and slope in real space for 
the four MaxBCG samples. The statistical errors on Rq and 7 come from the jackknife 
covariance. However, since there is some uncertainty in the photo-^ error variance, in the 
Table we show results for both cr^ = 0.01 and 0.007; the difference between them provides 
an estimate for the systematic errors in the inferred parameters. We also show the inferred 
parameter values in the case that photo- 2; errors are completely ignored, = 0; these 
correspond to the power-law fit parameters in photo-z space. 
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Table 3: Power-law fits to the cluster correlation function on scales r = 20 — 60 Mpc, 
for three values of the photo-2; error variance, az = 0.01, 0.007, and 0. The inferred real- 
space correlation length Rq is given in Mpc, and 7 is the inferred slope of the real-space 
correlation function. The fourth column gives the P^r degree of freedom for the best fit, 
while the fifth column shows the mean cluster separation d, in Mpc, assuming a sample 



volume of 0.5 h ^ Gpc'^. 








sample 


Rq 


7 


xVd-o.f. 


d 


= 0.01 


iV200 > 10 


15.93 ±0.33 


2.21 ±0.11 


0.45 


33.1 


N200 > 11 


16.45 ±0.38 


2.18 ±0.12 


0.36 


35.4 


N200 > 13 


18.14 ±0.43 


2.24 ±0.15 


0.33 


40.0 


N200 > 16 


19.33 ±0.48 


2.56 ±0.23 


0.48 


46.9 


= 0.007 


N200 > 10 


14.80 ± 0.46 


1.98 ±0.10 


0.42 


33.1 


N200 > 11 


15.27 ±0.52 


1.95±0.11 


0.35 


35.4 


iV2oo > 13 


17.14 ±0.56 


2.00 ±0.13 


0.31 


40.0 


N200 > 16 


19.18 ±0.58 


2.26 ±0.17 


0.47 


46.9 


a, = 


A^2oo > 10 


14.42 ± 0.73 


1.84 ±0.11 


0.43 


33.1 


N200 > 11 


14.81 ±0.81 


1.80 ±0.12 


0.39 


35.4 


N200 > 13 


16.94 ±0.92 


1.85 ±0.14 


0.30 


40.0 


N200 > 16 


20.10 ±0.92 


2.13 ±0.19 


0.49 


46.9 
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Fig. 7. — Correlation length Rq vs. mean cluster separation d for different cluster samples. 
Results for the MaxBCG catalog (this work) are shown as red diamonds. Vertical error 
bars include the systematic uncertainty in Rq due to photo-z error uncertainty as well as 
the marginalized error from the analysis; horizontal error bars correspond to an assumed 
20% systematic uncertainty in the maxBCG sample volume. Blac k squares are from the 
compilation of earlier cluster measurements of iBahcall et al. I (120031 ). in which a power-law 
slope 7 = 2 has been assu med. The dashed curv e is the power-law fit of Eqn. (l23l) to the 



prediction of ACDM, from I Younger et al. I (120051 ). 
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Models of structure formation predict that the correlation length -Rn should scale with 
the c l uster mean s eparation d, where 1 / = fir. (ISzalav fc Schramm Ill985l : iGovernato et al. 



19991 : iLake I Il999l : IColberg et al. I l2000l ) . lYounger et al. I (120051 ) studied this scaling using 



N-body simulations of the ACDM model and found that the results are well represented by 
a power-law relation over the separation range 20 Mpc < d < 60 Mpc, 



Ro = 1.7 



d 



Mpc 



0.6 



(23) 



for (Tg = 0.84. In Fig. [7] we plot Rq vs. d for the MaxBCG samp l es (re d diamonds) and for 
the cluster measurement compilation presented in lBahcall et al. 1 J2003h (black squares) and 
compare those with the power-law A CDM relation of Eqn . (l23l) (dashed curve). We have 



not plotted the SDSS EDR results of iBahcall et al. I (120031 ) . since they come from a subset 



of the current data and are therefore not independent of the new results we show here. 

As Fig. [7] shows, the correlation function results given here are broadly consistent with 
those from previous cluster samples, including the scaling of the correlation length with 
richness or mean separation. However, the correlation amplitude is slightly higher than the 
ACDM prediction of Eqn. ([23D, by about 14 (8) % for a, = 0.01 (0.007); the cluster bias is 
higher by the same factor. To decide whether this difference is significant will require more 
precise modeling of the maxBCG photo- 2; error distribution as a function of richness and 
redshift. In addition, Eqn. ( l23l) has been derived for mass-selected catalogs, and it therefore 
uses a different selection function from the observations. 



As noted in §3.2.31 we can compare the results for a photo- 2; dispersion of cr^ = 0.01 with 
those for = 0.007 to estimate the systematic error due to the uncertainty on the dispersion 
az- For the > 10 sample, the resulting systematic error on the correlation length is 
ARo = 1.13, about three times larger than the statistical error; for the correlation slope 
the systematic error is A7 = 0.22, about twice the statistic error. Including this systematic 
largely eliminates the discrepancy between the data and the model of Eqn. (123|) . as Fig. [7] 
shows. As noted in §3.2.31 this systematic can be reduced by more careful modeling of cxz 
and its redshift dependence. 



5.2. Correlation function on large scales 

Here we consider the cluster correlation function measured over a larger range of scales, 
from 20 to 195 /i"^ Mpc, and compare the results with the theoretical model of ^ We 
describe the model correlation function for each MaxBCG sample by a two-parameter model. 



^cc{r; s,b) = 6^ ^rrmi{r * s) , 



(24) 
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where f is the separation in photo-z space, C,mm is the matter correlation function of Eqn. 
dl]), corrected for photometric redshift errors as described by Eqn. f|T^ . assuming cr^ = 0.01, 
and the parameter s is described below. For comparison, we also show results for the model 
fits using the analytic correction of Eqn. ([H]) for the photo-z errors. Note that, in fitting for 
the cluster linear bias parameter b, we fix the linear matter power spectrum amplitude ag to 
its fiducial value of 0.9; in actuality, the fit constrains the product b{as/0.9). 

In this model, we have assumed that small variations in the cosmological parameters 
can be incorporated simply as changes in the predicted physical separation r, i.e., by the 
scale shift parameter s defined by 



s = 





1/3 


r Hiz-p*) 1 


1/3 


[ H{z;p) \ 




[dI{z-p*)\ 





(25) 



Here, z is the median survey redshift, H{z;p) is the Hubble parameter, Dm{z;p) = [c/(l + 
z') is the comoying angular diameter dis t ance, and jo represen t s the cosmological 



z)]f,'dz'/H 
parameters ( 



2007; Huetsi 



Seo et al II2003I : lEisenstein et al. II 20051 : lAngulo et al. II2008I : ISeo fc Eisenstein 



200 



3); p* represents the fiducial set of cosmological parameters enumerated at 



the end of §2.2[ The scale shift parameter involves a geometric average of the two transverse 
components and the line-of-sight component and therefore applies to spherically averaged 
separations r. Although the s parameter does not capture the full cosmological parameter 
dependence of the correlation function, it does describe the effects of small cosmological 
parameter variation on the location of the baryon acoustic peak to the accuracy we need 
( iBlake fc Glazebrook 1 12003| ). It is therefore a convenient model parametrization for the 
purpose of determining the significance of the BAO feature. 

To assess the significance of a possible BAO feature, we also compare the data to a 
model with no BA O feature in the l i near p ower spectrum, computed using the smooth 



transfer function of lEisenstein fc Hu I (119981 ). In the no-BAO case, the uncertainties in 
the measurements of the cluster correlation functions do not allow a meaningful constraint 
on the shift parameter s: with a flat prior on it, s tends to unphysically small values, 
particularly if we restrict the analysis to relatively large scales (r > 60/i~^Mpc). In the 
computation of the statistic, we therefore introduce, for both the BAO and no-BAO 
cases, a Gaussian prior on the parameter s, with central value s = 1 (corresponding to our 
fiducial cosmology) and standard deviation of 0.05, consistent with the current uncertainties 
i n the relevant cosrn ological parameters Vtmh'^ and VLi,h? from WMAP CMB observations 
(ISpergel et al. 1120071 ). Since we are not attempting to constrain cosmological parameters 
with this measurement but only gauging the significance of a possible BAO feature, this 
prior on s simply allows us to make a sensible comparison between the BAO and no-BAO 
models. 
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r [h-' Mpc] r [/]"■ Mpc] 

Fig. 8. — Measured correlation functions for the four MaxBCG samples (points) in 36 linear 
separation bins from r = 20 to 195 Mpc. Best-fit ACDM models from the two-parameter 
fits with BAO features (solid red curves) and without acoustic features (dashed red curves). 
Error bars on the data points are estimated using the jackknife, while the shaded green areas 
show the linear theory (Gaussian) predictions for the errors. Insets show close-ups of the 
region around the expected BAO feature. 
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Fig. 9. — Same as fig. [8] but showing ^(r)r^ to emphasize the large-scale features. 



- 32 - 



5.2.1. Results 

Figs. M and [9] show the measured cluster correlation functions together with the best-fit 
models from the two-parameter analysis both with and without acoustic features and using 
the geometric correction for photo- 2; errors. The error bars on the data points correspond 
to one standard deviation from the diagonal of the jackknife covariance matrix; since the 
covariance matrix is non-diagonal, the points in different separation bins are correlated. 
For each sample, the green shaded region shows the Gaussian prediction for the standard 
deviation, plotted around the best-fit model. 

In Table S] we present the best-fit values for the two parameters b and s for the four 
MaxBCG samples and the corresponding values. The fits are based on measurements in 36 
linear bins ranging in separation f from 20 to 195 Mpc, resulting in 34 degrees of freedom. 
For the upper rows of the Table, the correlation function covariance is determined from the 
jackknife method; the lower rows show results using the linear perturbation theory covariance 
matrix, Eqn. f[T7|) . Note that the linear theory covariance estimate requires knowledge of the 
linear bias factor b, which is one of the parameters we are aiming to extract from the data. 
We therefore first estimate the linear covariance using the values of b from the jackknife 
error fits; the resulting linear theory covariance estimates are then used to recompute the 
values and extract new estimates of b and s. 
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Table 4: Best-fit values for the parameters b, s from .^cc(^) measurements for the four 
MaxBCG samples, using the covariance from the jackknife method (upper rows) and from 
the Gaussian analytic prediction (lower rows), with the error on b marginalized over s. Last 
column displays difference between the best BAO and no-BAO fits. The fits use the 
geometric correction for photo-2; errors with = 0.01 and measurements over the range 
20 < f < 195 Mpc. The value corresponds to 34 degrees of freedom. 



sample 


BAO fits 


no-BAO fits 






iV200 > 


b s 


b 


s 






Covariance matrix from jackknife 


10 


2.80 ±0.13 0.96 24.6 


2.79 ±0.14 


0.96 


26.6 


2.0 


11 


2.91 ±0.15 0.97 22.3 


2.90 ±0.16 


0.98 


23.8 


1.5 


13 


3.26 ±0.20 0.97 18.0 


3.25 ±0.20 


0.98 


18.9 


0.9 


16 


3.76 ±0.24 1.02 19.2 


3.74 ±0.25 


1.03 


20.5 


1.3 


Covariance matrix from linear theory 


10 


2.86 ±0.11 0.98 93.1 


2.86 ±0.13 


1.00 


95.5 


2.4 


11 


2.95 ±0.12 0.97 97.9 


2.95 ±0.13 


0.99 


100.2 


2.3 


13 


3.35 ±0.14 1.00 77.9 


3.37 ±0.16 


1.02 


79.7 


1.8 


16 


3.96 ±0.19 1.04 76.5 


3.98 ±0.20 


1.07 


78.4 


1.9 
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Using the jackknife error estimates, the A''2oo > 10 sample shows a difference in be- 
tween the BAO and no-BAO models of Ax^ = 2; thi s corresponds to a marginal significance 



of 1.4(7 for the BAO feature. Using the same sample, iHuetsi I (120071 ) found a somewhat larger 
significance of about 2a for the BAO feature from a power spectrum analysis. Using the 
jackknife error and computing the theoretically expected difference for this sample between 
the BAO and no-BAO models, the typical expected difference corresponds to only la. That 
is, given the photo-z errors and the sample size, we would not expect a priori to find a 
highly significant BAO detection from this sample, as noted in §3.2.3[ The significance of 
the BAO feature generally goes down as the cluster richness threshold is increased, reflecting 
the larger Poisson errors for these smaller samples. The values for the bias parameter trend 
upward with increasing cluster richness, as expected on theoretical grounds — more massive 
clusters are more strongly clustered (see §5.31) — and consistent with the results on smaller 
scales ( §5.ip . 

If we drop the 5% Gaussian prior on the shift parameter s, the correlation function data 
of the A^2oo > 10 sample constrains it to the la range s = 0.92 ± 0.08 for the BAO model 
fit and the jackknife covaria nce matrix. The 8% erro r on s, roughly twice that for the SDSS 



spectroscopic LRG sample (lEisenstein et al. II 20051 ). does not yield a significant constraint 
on cosmological parameters. 

The same analysis carried out using the Gaussian (linear theory) error covariance yields 
significantly larger values for the statistic in all samples, a sign that this method possibly 
underestimates the errors. However, the differences in between the BAO and no-BAO 
models are comparable to though slightly larger than those for the jackknife errors. In the 
N200 > 10 case, for instance, Ax^ = 2.4, corresponding to a significance of l.Scr. 



5.2.2. Analytic photo-z error correction 
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Table 5: Same as Table H] but using the analytic photo- 2; error correction of Eqn. (jS]). 



sample 


BAG fits 


no-BAO fits 






iV200 > 


b s 


h 


s 


x' 




Covariance matrix from jackknife 


10 


2.84 ±0.14 0.94 24.3 


2.82 ±0.15 


0.95 


26.7 


2.4 


11 


2.96 ±0.16 0.96 21.9 


2.94 ±0.17 


0.97 


23.6 


1.7 


13 


3.32 ±0.20 0.97 17.6 


3.30 ±0.21 


0.98 


18.7 


1.1 


16 


3.82 ±0.25 1.01 19.0 


3.82 ±0.27 


1.02 


20.5 


1.5 


Covariance matrix from linear theory 


10 


2.89 ±0.12 0.96 91.7 


2.88 ±0.14 


0.98 


94.7 


3.0 


11 


2.98 ±0.13 0.96 96.4 


2.98 ±0.14 


0.98 


99.2 


2.8 


13 


3.38 ±0.15 0.98 76.7 


3.38 ±0.17 


1.00 


79.0 


2.3 


16 


3.99 ±0.20 1.02 75.6 


4.00 ±0.22 


1.04 


78.1 


2.5 
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Fig. 10. — Best-fit models to r'^^{r) for tlie > 10 sample, assuming different errors on the 
photometric redshift determination: cr^ = (no correction, dotted black curve), cr^ = 0.005 
(short-dashed blue curve), cr^ = 0.007 (long-dashed magenta curve), = 0.01 (continuous 
red curve), and = 0.02 (dot-dashed green curve), using the geometric photo- 2; correction. 
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To gauge the robustness of these resuhs, in Table [5] we present the model fits using the 
analytic photo- 2; correction of Eqn. ([8]) in place of the geometric correction. The best-fit 
values of the parameters as well as the absolute values and differences in are close to and 
consistent with those presented above. The values of the bias parameters and the significance 
of the BAO feature are both slightly higher in this case. This is traceable to the fact that 
the analytic photo-^ correction leads to slightly less smoothing of the correlation function 
than the geometric correction for = 0.01, as can be seen from Fig. [31 On one hand, 
reduced smoothing decreases the correlation amplitude at small scales, where the error bars 
are smaller. On the other hand, a more pronounced (less smoothed) acoustic peak better 
fits the excess of power at scales of about llO/i^^Mpc that is clearly visible in the data in 
Fig.[9l For the > 10 sample, the significance of the BAO feature here is about 1. 5 - 1.7a, 



depen ding on the covariance estimate (jackknife or linear), closer to the result of (IHuetsi 



20071 ). which used the same analytic photo- 2; error correction to the power spectrum. 



5.2.3. Dependence on photometric redshift error 
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Table 6: Best-fit values for tlie 2-parameter analysis of the > 10 sample assuming 

different values for the photometric redshift error o"^. The middle rows correspond to the 
geometric correction, the lower ones to the analytic method. Here we assume jackknife 
covariance, and there are 34 d.o.f. for the analysis. 





BAG fits 
h s \^ 


no-BAG fits 


Ax' 


No photo-2 correction 





2.30 0.90 39.6 


2.17 0.86 43.6 


4.0 


Geometric photo-2 correction 


0.005 
0.007 
0.01 
0.02 


2.27 0.88 31.8 
2.44 0.90 27.8 
2.81 0.96 24.6 
4.20 1.14 36.1 


2.19 0.86 33.8 
2.39 0.89 29.6 
2.79 0.96 26.6 
4.19 1.15 36.2 


2.0 
1.8 
2.0 
0.1 



Analytic photo- 2; correction 



0.005 


2.28 


0.88 


31.7 


2.22 


0.86 


33.2 


1.5 


0.007 


2.51 


0.91 


27.6 


2.45 


0.90 


29.5 


1.9 


0.01 


2.84 


0.94 


24.3 


2.82 


0.95 


26.7 


2.4 


0.02 


3.85 


1.02 


25.0 


3.88 


1.04 


26.3 


1.3 
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Here we assess the impact of uncertainties in the photometric redshift errors on the 
results. We carry out the analysis of the A''2oo > 10 sample using different values of the 
photo- 2; dispersion a^. In Table E] we present the best-fit parameter and values for cr^ = 0, 
0.005, 0.01, and 0.02 for both the geometric and analytic photo-z error corrections. The 
significance of the BAO feature (Ax^) is not strongly dependent on the photo- 2; dispersion, 
except for the case Oz = 0.02, which is twice as large as estimated from the maxBCG catalog. 
Fig. [To] shows the best-fit models of table[6]using the geometric photo-z correction, compared 
to the data for the iV2oo > 10 sample. 

As we discussed in §5.11 we can use the difference in results for cr^ = 0.01 and = 
0.007 to derive an approximate systematic error estimate for the bias parameter due to the 
uncertainty in a^- For the A^2oo ^ 10 sample, a difference of Acr^ = 0.003 corresponds 
to a difference of Ab = 0.37. As we noted for the correlation length in §5.H this error is 
significantly larger than the statistical error of about 0.14 for this sample. However, this 
systematic error estimate is quite conservative. Moreover, any redshift dependence of can 
be included in the geometric method and should not be counted as a systematic error. We 
postpone a more detailed discussion of this point to a future paper. 

5.2.4- The integral constraint 

So far we have not modeled the impact of the integral constraint, which arises from 
the fact that the integral of the two-point function over the survey volume is assumed to be 
zero when estimating the sample density in Eqn. ([1]). One can model this by including an 
additive constant c in the correlation function, 

(26) 

We find that the improvement to the model by treating this term as an extra free parame- 
ter is very small. For instance, for the iV2oo > 10 sample, the analysis of this three-parameter 
model gives a marginalized value of c = 0.0002±0.0003, consistent with zero, while it changes 
the best-fit values for h and s by amounts much smaller than their statistical errors. Com- 
pared to the two-parameter model above, the value of the best-fit decreases by only 0.4 
(for 34 d.o.f.) for both the BAO and no-BAO models. 



sobs 



V; s) + 



-40- 



0.5 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > 

_i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i_ 


S-pnrnmptf>r 
3-parameter 

Nj,„ £10. r£20 /j-i Mpc 
68% CL 


- / \ - 
{ 1 
/ / 
i / 
I / 

1 / 

; I / \ 

^ 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


_ 1 1 1 1 ' 1 1 _ 

/ \ - 

/ 

- 1 \ - 
/ J - 

; / ■ 
/ / - 
' / 
' / 
1 / 

" , 1 , 1 . 1 , " 



2.2 2.4 2.6 2.8 0,8 0.0 1 

b 8 



Fig. 11. — 1-cr contour plots for b, s, and c x 10^ (dashed, blue curves) marginalized over 
the third parameter, for the > 10 sample.. Upper left panel also shows results of a 
2-parameter analysis {b and s: red, continuous line) for comparison. 
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To illustrate these points, in the upper left panel of Fig. [TT] we show the 1-a contour 
for the 2-parameter analysis (red, continuous line) compared to the 3-parameter analysis 
in which we marginalize over c (blue, dashed lines) for the A^2oo ^ 10 sample. In the 
latter case, we do not impose the WMAP prior on the shift parameter s, and we only 
consider the baryonic model. Other panels show contours for c versus b and s, marginalized 
over the missing parameter. The degeneracy between the bias and shift parameters is not 
significantly affected by the introduction of c. The additive constant also does not introduce 
large degeneracies with the other two parameters, and its allowed values are consistent with 
zero. 



5.2.5. Sample Purity 



According to iKoester et al. I (j2007al ). the MaxBCG cluster sample is estimated to be 
about 90% pure. As a simple test of the effects of sample purity on the correlation function, 
we replace 10% of the clusters in the A^'2oo > 10 sample with randomly distributed points. 
As expected, this lowers the correlation function amplitude by a factor of about (0.9)^, 
corresponding to a ~ 10% systematic uncertainty in the bias, with negligible change in the 
shift parameter s. We also find slightly lower values for the x^, particularly for the no-BAO 
models, when the jackknife covariance and the geometric correction for photo- 2; errors are 
used. This results in a significantly lower value for the difference between the BAO and 
the no-BAO model, Ax^ = 0.7. 



5.3. Cluster bias 



Here we compare the values obtained for the cluster bias parameters for the different 



1996: 


Mo et al. 


1996 





Sheth fc Tormenlll999l ). We also 



translate the bias-richness relation into a measurement of bias vs. halo mass, using the 
mass-richn ess relation derived fr om statistical weak lensing measurements for this cluster 
sample in (I Johnston et al. 1120071 ). 



For this analysis, we compare results using three different separation intervals: r = 
20 - 60 h-^ Mpc, 20 - 195 h'^ Mpc, and 60 - 195 h'^ Mpc. The resuhing values for the linear 
bias are given in Table [71 and the corresponding fits to the data of the iV2oo > 10 sample 
are plotted in Fig. [T2i As in the previous section, the 1 — a errors on h are determined after 
marginalizing over the shift parameter s, including the 5% Gaussian CMB prior on s. The 



ACDM model with BAO is assumed. 
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Table 7: Linear cluster bias parameter b estimated for the different samples, using differ- 
ent ranges in the separation f and the jackknife covariance matrix. The values have been 
marginalized over the shift parameter s, using a Gaussian prior with mean s = 1 and disper- 
sion (Tg = 0.05. We use the geometric photo-2; error correction with = 0.01 and assume 
(Tg = 0.9 for the mass clustering amplitude. 



sample 




range [ h ^ Mpc] 




iVaoo > 


20 < f < 195 


20 < f < 60 


60 < f < 195 


10 


2.81 ±0.13 


2.87 ±0.16 


3.21 ±0.37 


11 


2.91 ±0.15 


2.96 ±0.17 


3.35 ±0.44 


13 


3.26 ±0.19 


3.33 ±0.21 


3.40 ±0.55 


16 


3.76 ±0.24 


3.82 ±0.27 


3.46 ±0.77 



20 40 60 80 100 120 140 160 180 200 

r [h~^ Mpc] 



Fig. 12. — Two-parameter ACDM fits to the correlation function of the A^2oo ^ 10 sample, 
using different separation ranges: r = 20 — 60 Mpc (short-dashed blue), 20 — 195 Mpc 
(solid black), and 60 — 195 Mpc (long-dashed red). Here we use jackknife error covariance 
and the geometric photo-z error model with = 0.01. 
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Table [7] indicates that the bias increases with richness, as noted above. The increase 
is driven by the measurements on scales r < 60 Mpc: there is no significant trend of 
correlation amplitude with richness on larger scales, because the statistical errors are large 
there. In addition, for the iV2oo > 10 and 11 samples, the bias on scales r > 60 Mpc 
appears to be larger than that on smaller scales, as indicated by the mismatch of the model 
fits in Fig. [121 This could be an indication of scale-dependent bias or of extra large-scale 
power beyond that expected in ACDM, as also suggested by the photometric LRG analyses 
( iBlake et al. 1120071 : iPadmanabhan et al. 1120071 1. but the trend is not statistically significant 
given the errors for the current sample. 
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Fig. 13. — Cluster bias as a function of richness. The data points correspond to the values 
measured in the correlation function analysis, Table U\ with geometric correction for photo- 2; 
errors. Points correspond to different separation ranges for the measurement: r = 20 to 
195/i-iMpc (black), 20 - 60 /i"^ Mpc (blue), and 60 - igS/i'^Mpc (red); the points are 
slightly displaced from each other horizontally for clarity. The curves represent the Halo 
Model predictions, for which mass has been translated to richness by matching the cluster 
counts to the halo mass function at 2; = 0.22. Dashed curve assumes Zf = Zq, and the 
continuous curve accounts for the distribution of formation redshifts [Eqn. ( !34|) ]. Shaded 
regions around the HM curves show the errors due to 20% uncertainty in the survey volume. 
Left panel: assuming ag = 0.9 for the linear power spectrum normalization; Right panel: 
as = 0.8. Here the bias values measured from the correlation function have been rescaled by 
0.9/0.8 since the clustering amplitude is proportional to (focrg)^. 
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The Halo Model (hereafter H M) provides an analytic expression fo r the bias of halos 



as a function of halo mass, bhiM) flMo fc White Ill996l : IMo et al. Ill996l : ISheth fc Tormen 



19991 ). We can therefore express the expected value for the bias of a given cluster richness 
sample as 

1 r"^ 

6 ~ _ / dMn^iM, z) hh{M, z) , (27) 

where nh{M,z) is the mass function of dark matter halos at redshift z, and bh{M,z) is the 
halo bias function from HM. The threshold mass Mmm for a given richness sample can be 
determined by requiring that the theoretical cluster number density, 



ric 



dMnh{M,z) 



{21 



matches the observed value for the sample. The implicit simplifying assumption in this 
matching is that halo mass is a monotonic function of cluster richness, with no scatter in 
the relation between them. 

We use the Sheth & Tormen ( Sheth fc Tormen~lll999l ) formula for nh{M, z), 

p din a 



nh{M,z) 



(29) 



M2dlnM- 

where u = 5c/iy{M, z) with 5c = 1.686, cr^(M, z) is the variance in the linear density pertur- 
bation amplitude on mass scale M at redshift z, and the function 




(30) 



with A = 0.322, p = 0.3, and q = 0.707. We neglect the mild dependence of the critical 
density 6c on the value of Qrn{z), using the constant value from spherical collapse in an 
Einstein-de Sitter Universe. For our fiducial cosmology, and assuming the survey volume 
to be 0.5 Gpc^, by matching the number densities we find for our four samples the 
theoretical mass thresholds M^i„ = 8.6, 9.9, 12.5 and 16.7 x lO^^/i'^Mg, respectively for 
-^200 > 10, 11, 13 and 16. Note that these mass estimates are about a factor of two higher 
than t hose inferred directly from weak lensing measurements for the same cluster sample 
( §2.ip (jjohnston et al. 1120071 ). a point to which we return below. 

The halo bias function bh, which depends on both the redshift of obs ervation Zr, and o n 
the redshift of forrnation Zf of the halos, is given in the HM by (see, e.g., iMo et al. I (119961 ): 



Scoccimarro et al. I (120071 )) 



bh{M, Zo, Zf) 



1 + 



2p/Sf{zo,Zf) 
l + [qi^'{zf)]P' 



(31) 
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where 5f{zo,Zf) = 5cD{zo)/D{zf), with D{z) the hnear perturbation growth function. In 
estimating the bias, it is sometimes simply assumed that the redshift of formation and 
observation coincide, Zo = zj, in which case Eqn. (^T]) can be substituted directly into 
Eqn. fl27l) to calculate the expected bias for a given cluster sample. However, this assumption 
is clearly not realistic. 

To treat the more plausible case Zf > Zo requires a prescription for the probability 
distribution of the redshift of fo rmation Zf for clusters of mass M observed at redshift Zo- 
This issue was first explored in (ILacey fc Cole Ill993l . Il994l ). where the formation time of a 
halo observed at redshift Zo is defined as the time when the most massive progenitor accreted 
a mass equal to half the final mass. They also provided relatively simple formulas to compute 
th e formation redshift d istribution based on the spherical collapse model. Here we make use 
of iGiocoli et al. I (120071 ). where an improved estimate of the formation time is proposed and 
compared to N-body simulations. We consider the rescaled probability for the redshift of 
formation Zf of a. halo of mass M observed at redshift Zo to be given by 



p{uj) =2uj erfc(ct;/v2) , 



(32) 



where 



(33) 



^a^{M/2,Zo)-a^{M,Zo) ' 

and where, as above, q = 0.707. The bias for halos of mass M observed at redshift Zo is then 
given by 



bh{M, Zo) 



bhiM,Zo,Zf) X 



p[u;{M, Zo, Zf)] — '—dz 



(34) 



where the function bh{M, Zo, Zf) is given by Eqn. ( 1311) . and we assume Zo = 0.22 as the mean 
redshift of observation for the cluster sample. 



13.5 14 14.5 13.5 14 14.5 

l°g,o MJ] log,„ [M„y(/i-> MJ] 



Fig. 14. — Bias vs. halo virial mass. Points indicate data from the cluster correlation 
function using the full separation range r = 20 — 195 Mpc. Cluster richness has been 
translated into halo virial mass using the weak lensing relation of Eqn. (l35l) . The smaller 
(black) horizontal error bars correspond to the weak lensing meas urement uncertainty o n 
the constant of proportionality Mmr\2o in the mass-richness relation fj Johnston et al. 1120071 ). 



while the larger (red) horizontal error bars also include a 50% scatter in halo mass at fixed 
richness. The position of the data point itself is the mean value in both cases; the inferred 
mass is slightly smaller when the intrinsic scatter is taken into account. Solid curve shows 
halo bias prediction, bh{M, z), from the Halo Model, using the mean cluster redshift z = 0.22. 
Left panel: assuming erg = 0.9; Right panel: = 0.8. 
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Fig. [13] shows the predicted HM values for the cluster bias as a function of richness 
and compares them with the values of the bias parameters obtained from the correlation 
function analysis (Table [7]). The dashed curve corresponds to the approximation Zo = zj, 
while the continuous curve uses the more realistic treatment of Eqn. flMl) . The shaded areas 
around the curves represent the errors on these predictions assuming an uncertainty in the 
survey volume of 20%. The right panel of Fig. [13] shows the HM results with a lower power 
spectrum amplitude than our fiducial value, Ug = 0.8. The HM prediction, including the 
treatment of the formation redshift, appears to be in satisfactory agreement with the cluster 
correlation function measurements, especially for the lower value of as- We also note that our 
result for the A^2oo > 10 sample is in good agreement with the valu e for the bias p arameter 
obtained from the power spectrum analysis of the same sample in iHuetsi I ( 120071 ). that is, 
b = 3(cT8/0.85). 



In Fig. [131 the HM prediction for halo bias vs. mass has been translated to bias vs. 
richness by using the number density matching condition in Eqn. (1281) . As noted above, the 
halo mass for given richness derived from this procedure is a factor of two higher than tha t 



inferred more directly from statistical weak lensing measurements ( [Johnston et al. I l2007l ) 



which suggests that the HM curves in Fig. [12] should perhaps be shifted horizontally to the 
right by about a factor of two in iV2oo (since halo mass is close to linear in richness). A 
plausible cause o f this mismatch is the neglect of the large scatter (~ 50%) in the mass- 
richness relation (iRozo et al. Il2007bl ) when matching the number density of clusters to the 
theoretical density of massive halos. 

To address this problem, we can dispense with the number- density matching and instead 
make use of the weak lensing results to translate optical cluster richness to halo virial mass, 
i.e., we can translate the cluster correlation function measurements into a measure of bias vs. 
halo mass, with a minimum of theoretical assumptions. Those results can then be compared 
directly with the HM predictions for bh{M). Using statistical weak lensing measurements, 
Johnston et al. I (l2007l ) found the mean relation between halo virial mass and cluster richness 
to be 

A^200 



200) 



vir \20 



20 



(35) 



where M^ir\2o = 1-1 x 10^^ Mq and = 1.29. The combined statistical plus systematic 
error on Mmr\2o is about 17%; the corresponding error on the exponent aN is about 3%. 
However, as noted above, the intrinsic scatter in the mass at fixed richness is larger, of order 
50%. Including that scatter, we can calculate the distribution of halo masses, P{Mmr)i for 
each of the four cluster richness samples. For each sample, we use the mean of the P{Mmr) 
distribution as the effective mean halo mass and the 68% confidence interval to denote the 



spread in mass. In recent work iRozo et al.l (120081 ) have shown that the mass calibration 
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in Eqn. fl35|) has a bias that can be corrected by boosting the virial masses by 18%, this 
correction is apphed in the results presented here. 

The resuhing bias vs. mass relation inferred from the cluster correlation function and 
weak lensing mass-richness calibration is shown by the data points in Fig. [TUfor erg = 0.8 
and 0.9. For comparison, the linear halo bias vs. mass relation from HM is shown by the 
solid curve. The measured cluster bias at the central effective halo mass for each richness 
sample appears to be about ~ 15 — 20% higher than the HM prediction, with a slightly lower 
discrepancy for lower erg. This is consistent with the results of §5.11 To further illustrate this 
trend, in Fig. [15] we show the same comparison for ag = 0.7, where the agreement between 
the measurements and the HM is further improved. The fact that the measured bias is 
slightly higher th an that predicted from the HM could be an indicator of halo assembly bias 



(IWu et al. Il2008h . 
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Conclusions 



We have measured the large-scale 3D correlation function for four richness samples in the 
MaxBCG catalog of optically selected galaxy clusters from the SDSS, currently the largest 
cluster catalog available. Since the cluster redshifts in this sample were estimated photo- 
metrically, our modeling of the observed correlation function includes a careful treatment of 
the impact of photo-^ errors. The geometric approach to photo- 2; errors we have introduced 
should be of broad utility in the analysis of future large photometric galaxy surveys. 

On scales r = 20 — 60 Mpc, the cluster correlation function is well fit by a power law 
in separation, with a correlation scale i?o that increases with cluster richness. We have deter- 
mined the relation between the correlation scale Rq and the mean cluster separation d, finding 
qualitative agr e emen t with the compilation of previous cluster measurements presented in 



Bahcall et al. I (120031 ). The sc aling of -Rn with d is also consistent with that predicted in 
N-body simulations of ACDM flYounger et al. II2005I ). but with a slightly (10 — 15%) higher 



value of Rq at fixed d. 

We have modeled the large-scale correlation function on scales r = 20 — 195 Mpc 
using a non-linear model of the ACDM power spectrum that includes the effects of non-linear 
damping of the baryon acoustic peak. Non-linear damping, coupled with the estimated 
photo-2; errors, imply that we do not expect a robust detection of the BAO feature in 
these samples. Indeed, we find that the data set does not yield a clear detection of baryon 
acoustic features in the correlation function: for the largest sample considered, A''2oo > 10, 
the significance for the best-fit BAO model with respect to a featureless model is about 
1.4 — 1.7(7, depending on whether the covariance matrix is determined using the jackknife 
procedure or linear perturbation theory. 

Comparison of the clustering on scales less than and greater than r ~ 60 /i"^ Mpc 
provides weak evidence that the clustering amplitude for the ^ 10 and 11 samples 

is larger on large scales than on small scales, relative to a ACDM power spectrum. Such 
a suggestion of extra large -scale power has a l so been seen in the c l uster ing of the SDSS 



photometric LRG sample flBlake et al. 1 120071 : iPadmanabhan et al. 1120071 ). but statistical 



confirmation will require samples covering larger volumes. 

Finally, we have combin ed the clustering meas urements with weak lensing calibration 
of the mass-richness relation (j Johnston et al. 1120071 ) to directly infer the bias as a function 
of halo mass, a fundamental quantity in studies of structure formation. Again, the trend of 
increasing bias with halo mass is qualitatively consistent with the predictions of ACDM, but 
the amplitude of the bias is ~ 15 — 20% higher than the model prediction. This disagreement 
is reduced for lower values of the power spectrum amplitude ag. Given the large intrinsic 
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scatter in the relation between halo mass and cluster optical richness, however, as well as 
uncertainty in the photo- 2; error dispersion cx^, it is not yet clear whether this is a significant 



discrepancy. Moreover, an elevated cluster bias could be a sign of assembly bias (jWu et al. 
2008h . 
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